Overview

Dataset info

Number of variables22
Number of observations8950
Missing cells0 (0.0%)
Duplicate rows0 (0.0%)
Total size in memory1.5 MiB
Average record size in memory176.0 B

Variables types

Numeric18
Categorical1
Boolean0
Date0
URL0
Text (Unique)0
Rejected3
Unsupported0

Warnings

CASH_ADV_AMOUNT has 4628 (51.7%) zeros Zeros
CASH_ADVANCE is highly correlated with CASH_ADV_AMOUNT (ρ = 0.9763639831) Rejected
CASH_ADVANCE_FREQUENCY has 4628 (51.7%) zeros Zeros
CASH_ADVANCE_TRX has 4628 (51.7%) zeros Zeros
INSTALLMENTS_PURCHASES has 3916 (43.8%) zeros Zeros
MONTHLY_AVG_PURCHASE has 2044 (22.8%) zeros Zeros
ONEOFF_PURCHASES is highly correlated with MONTHLY_AVG_PURCHASE (ρ = 0.9130598274) Rejected
ONEOFF_PURCHASES_FREQUENCY has 4302 (48.1%) zeros Zeros
PAYMENT_MIN_PAY is highly skewed (γ1 = 43.00419578) Skewed
PAYMENT_MIN_PAY has 240 (2.7%) zeros Zeros
PAYMENTS has 240 (2.7%) zeros Zeros
PRC_FULL_PAYMENT has 5903 (66.0%) zeros Zeros
PURCHASES is highly correlated with ONEOFF_PURCHASES (ρ = 0.9168445587) Rejected
PURCHASES_FREQUENCY has 2043 (22.8%) zeros Zeros
PURCHASES_INSTALLMENTS_FREQUENCY has 3915 (43.7%) zeros Zeros
PURCHASES_TRX has 2044 (22.8%) zeros Zeros

Variables

BALANCE
Numeric

Distinct count8871
Unique (%)99.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean1564.474828
Minimum0
Maximum19043.13856
Zeros (%)0.9%
Mini histogram

Quantile statistics

Minimum0
5-th percentile8.81451835
Q1128.2819155
Median873.385231
Q32054.140036
95-th percentile5909.111808
Maximum19043.13856
Range19043.13856
Interquartile range1925.85812

Descriptive statistics

Standard deviation2081.531879
Coef of variation1.330498799
Kurtosis7.6747513
Mean1564.474828
MAD1459.747302
Skewness2.393386043
Sum14002049.71
Variance4332774.965
Memory size70.0 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.00000000e+00 9.95000000e-05 8.16750000e-03 8.84690150e+00 2.89368260e+01 ... 6.07344107e+03 8.11840880e+03 9.65527800e+03 1.25372974e+04 1.90431386e+04], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 80 0.9%
 
1100.941072 1 < 0.1%
 
40.074484 1 < 0.1%
 
2093.844656 1 < 0.1%
 
179.765708 1 < 0.1%
 
12.654903 1 < 0.1%
 
1893.704851 1 < 0.1%
 
1571.218695 1 < 0.1%
 
31.285608 1 < 0.1%
 
1772.323491 1 < 0.1%
 
Other values (8861) 8861 99.0%
 

Minimum 5 values

ValueCountFrequency (%) 
0 80 0.9%
 
0.000199 1 < 0.1%
 
0.001146 1 < 0.1%
 
0.001214 1 < 0.1%
 
0.001289 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
19043.13856 1 < 0.1%
 
18495.55855 1 < 0.1%
 
16304.88925 1 < 0.1%
 
16259.44857 1 < 0.1%
 
16115.5964 1 < 0.1%
 

BALANCE_FREQUENCY
Numeric

Distinct count43
Unique (%)0.5%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.8772707256
Minimum0
Maximum1
Zeros (%)0.9%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0.272727
Q10.888889
Median1
Q31
95-th percentile1
Maximum1
Range1
Interquartile range0.111111

Descriptive statistics

Standard deviation0.2369040027
Coef of variation0.2700466296
Kurtosis3.092369622
Mean0.8772707256
MAD0.1736723384
Skewness-2.023265519
Sum7851.572994
Variance0.05612350649
Memory size70.0 KiB
Histogram
Histogram with fixed size bins (bins=43)
Histogram
Histogram with variable size bins (bins=[0. 0.0954545 0.1742425 0.190909 0.2613635 ... 0.8257575 0.8819445 0.9045455 0.9545455 1. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1 6211 69.4%
 
0.909091 410 4.6%
 
0.818182 278 3.1%
 
0.727273 223 2.5%
 
0.545455 219 2.4%
 
0.636364 209 2.3%
 
0.454545 172 1.9%
 
0.363636 170 1.9%
 
0.272727 151 1.7%
 
0.181818 146 1.6%
 
Other values (33) 761 8.5%
 

Minimum 5 values

ValueCountFrequency (%) 
0 80 0.9%
 
0.090909 67 0.7%
 
0.1 8 0.1%
 
0.111111 5 0.1%
 
0.125 9 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
1 6211 69.4%
 
0.909091 410 4.6%
 
0.9 55 0.6%
 
0.888889 53 0.6%
 
0.875 57 0.6%
 

CASH_ADV_AMOUNT
Numeric

Distinct count4323
Unique (%)48.3%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean88.97798368
Minimum0
Maximum3928.10098
Zeros (%)51.7%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q399.08519648
95-th percentile425.5485622
Maximum3928.10098
Range3928.10098
Interquartile range99.08519648

Descriptive statistics

Standard deviation193.1361147
Coef of variation2.170605657
Kurtosis44.18167149
Mean88.97798368
MAD115.3419645
Skewness4.944361178
Sum796352.9539
Variance37301.55882
Memory size70.0 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.00000000e+00 6.46464364e-01 1.50692421e+00 1.66356808e+00 3.01886112e+00 ... 5.61295807e+02 9.07690328e+02 1.28674849e+03 1.90819165e+03 3.92810098e+03], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 4628 51.7%
 
16.14484475 1 < 0.1%
 
251.4308613 1 < 0.1%
 
217.2942326 1 < 0.1%
 
115.4271851 1 < 0.1%
 
161.2852262 1 < 0.1%
 
118.7035502 1 < 0.1%
 
163.2511244 1 < 0.1%
 
628.5148048 1 < 0.1%
 
49.961333 1 < 0.1%
 
Other values (4313) 4313 48.2%
 

Minimum 5 values

ValueCountFrequency (%) 
0 4628 51.7%
 
1.292928727 1 < 0.1%
 
1.503564 1 < 0.1%
 
1.510284417 1 < 0.1%
 
1.510556917 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
3928.10098 1 < 0.1%
 
2693.865496 1 < 0.1%
 
2440.175763 1 < 0.1%
 
2381.277231 1 < 0.1%
 
2274.707147 1 < 0.1%
 

CASH_ADVANCE
Highly correlated

This variable is highly correlated with CASH_ADV_AMOUNT and should be ignored for analysis

Correlation0.9763639831

CASH_ADVANCE_FREQUENCY
Numeric

Distinct count54
Unique (%)0.6%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.1351442003
Minimum0
Maximum1.5
Zeros (%)51.7%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30.222222
95-th percentile0.583333
Maximum1.5
Range1.5
Interquartile range0.222222

Descriptive statistics

Standard deviation0.2001213881
Coef of variation1.480798937
Kurtosis3.334734328
Mean0.1351442003
MAD0.1528463515
Skewness1.828686266
Sum1209.540593
Variance0.04004856999
Memory size70.0 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0. 0.0416665 0.087121 0.0954545 0.154762 ... 0.763889 0.8257575 0.845238 1.0454545 1.5 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 4628 51.7%
 
0.083333 1021 11.4%
 
0.166667 759 8.5%
 
0.25 578 6.5%
 
0.333333 439 4.9%
 
0.416667 273 3.1%
 
0.5 215 2.4%
 
0.583333 142 1.6%
 
0.666667 125 1.4%
 
0.090909 70 0.8%
 
Other values (44) 700 7.8%
 

Minimum 5 values

ValueCountFrequency (%) 
0 4628 51.7%
 
0.083333 1021 11.4%
 
0.090909 70 0.8%
 
0.1 39 0.4%
 
0.111111 29 0.3%
 

Maximum 5 values

ValueCountFrequency (%) 
1.5 1 < 0.1%
 
1.25 1 < 0.1%
 
1.166667 2 < 0.1%
 
1.142857 1 < 0.1%
 
1.125 1 < 0.1%
 

CASH_ADVANCE_TRX
Numeric

Distinct count65
Unique (%)0.7%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean3.248826816
Minimum0
Maximum123
Zeros (%)51.7%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q34
95-th percentile15
Maximum123
Range123
Interquartile range4

Descriptive statistics

Standard deviation6.824646744
Coef of variation2.100649598
Kurtosis61.64686248
Mean3.248826816
MAD4.002914191
Skewness5.721298203
Sum29077
Variance46.57580318
Memory size70.0 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 0. 0.5 1.5 2.5 4.5 ... 17.5 24.5 31.5 52.5 123. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 4628 51.7%
 
1 887 9.9%
 
2 620 6.9%
 
3 436 4.9%
 
4 384 4.3%
 
5 308 3.4%
 
6 246 2.7%
 
7 205 2.3%
 
8 171 1.9%
 
10 150 1.7%
 
Other values (55) 915 10.2%
 

Minimum 5 values

ValueCountFrequency (%) 
0 4628 51.7%
 
1 887 9.9%
 
2 620 6.9%
 
3 436 4.9%
 
4 384 4.3%
 

Maximum 5 values

ValueCountFrequency (%) 
123 3 < 0.1%
 
110 1 < 0.1%
 
107 1 < 0.1%
 
93 1 < 0.1%
 
80 1 < 0.1%
 

CREDIT_LIMIT
Numeric

Distinct count205
Unique (%)2.3%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean4494.282473
Minimum50
Maximum30000
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum50
5-th percentile1000
Q11600
Median3000
Q36500
95-th percentile12000
Maximum30000
Range29950
Interquartile range4900

Descriptive statistics

Standard deviation3638.646702
Coef of variation0.809616824
Kurtosis2.837370718
Mean4494.282473
MAD2838.84651
Skewness1.52263595
Sum40223828.13
Variance13239749.82
Memory size70.0 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 50. 475. 550. 625. 675. ... 14800. 15250. 18250. 20250. 30000.], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
3000 785 8.8%
 
1500 722 8.1%
 
1200 621 6.9%
 
1000 614 6.9%
 
2500 612 6.8%
 
4000 506 5.7%
 
6000 463 5.2%
 
5000 389 4.3%
 
2000 371 4.1%
 
7500 277 3.1%
 
Other values (195) 3590 40.1%
 

Minimum 5 values

ValueCountFrequency (%) 
50 1 < 0.1%
 
150 5 0.1%
 
200 3 < 0.1%
 
300 14 0.2%
 
400 3 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
30000 2 < 0.1%
 
28000 1 < 0.1%
 
25000 1 < 0.1%
 
23000 2 < 0.1%
 
22500 1 < 0.1%
 

INSTALLMENTS_PURCHASES
Numeric

Distinct count4452
Unique (%)49.7%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean411.0676447
Minimum0
Maximum22500
Zeros (%)43.8%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median89
Q3468.6375
95-th percentile1750.0875
Maximum22500
Range22500
Interquartile range468.6375

Descriptive statistics

Standard deviation904.3381152
Coef of variation2.199973963
Kurtosis96.57517753
Mean411.0676447
MAD482.8530134
Skewness7.299119909
Sum3679055.42
Variance817827.4266
Memory size70.0 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.000000e+00 9.750000e-01 9.430000e+00 4.485000e+01 9.998000e+01 ... 3.207565e+03 4.290080e+03 7.058560e+03 1.296145e+04 2.250000e+04], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 3916 43.8%
 
100 14 0.2%
 
300 14 0.2%
 
200 14 0.2%
 
150 12 0.1%
 
125 11 0.1%
 
75 9 0.1%
 
225 8 0.1%
 
350 8 0.1%
 
450 8 0.1%
 
Other values (4442) 4936 55.2%
 

Minimum 5 values

ValueCountFrequency (%) 
0 3916 43.8%
 
1.95 1 < 0.1%
 
4.44 1 < 0.1%
 
4.8 1 < 0.1%
 
6.33 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
22500 1 < 0.1%
 
15497.19 1 < 0.1%
 
14686.1 1 < 0.1%
 
13184.43 1 < 0.1%
 
12738.47 1 < 0.1%
 

LIMIT_USAGE
Numeric

Distinct count8871
Unique (%)99.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.3888836388
Minimum0
Maximum15.90995114
Zeros (%)0.9%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0.002942983188
Q10.041493866
Median0.3027203298
Q30.7175712839
95-th percentile0.9666855542
Maximum15.90995114
Range15.90995114
Interquartile range0.6760774179

Descriptive statistics

Standard deviation0.3897215686
Coef of variation1.002154706
Kurtosis279.6098891
Mean0.3888836388
MAD0.314582534
Skewness7.416990716
Sum3480.508567
Variance0.151882901
Memory size70.0 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.00000000e+00 2.48750000e-08 2.11900000e-06 3.60619252e-03 1.12257443e-02 ... 1.02375535e+00 1.09695299e+00 1.32203693e+00 2.17950097e+00 1.59099511e+01], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 80 0.9%
 
0.6244706578 1 < 0.1%
 
0.9751073289 1 < 0.1%
 
0.028122261 1 < 0.1%
 
0.176516258 1 < 0.1%
 
0.03877549481 1 < 0.1%
 
0.03087907427 1 < 0.1%
 
0.04338096267 1 < 0.1%
 
0.7487229662 1 < 0.1%
 
0.7930285695 1 < 0.1%
 
Other values (8861) 8861 99.0%
 

Minimum 5 values

ValueCountFrequency (%) 
0 80 0.9%
 
4.975e-08 1 < 0.1%
 
1.618666667e-07 1 < 0.1%
 
7.64e-07 1 < 0.1%
 
8.593333333e-07 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
15.90995114 1 < 0.1%
 
2.325421833 1 < 0.1%
 
2.033580108 1 < 0.1%
 
1.718885963 1 < 0.1%
 
1.570210437 1 < 0.1%
 

MINIMUM_PAYMENTS
Numeric

Distinct count8636
Unique (%)96.5%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean844.9067666
Minimum0.019163
Maximum76406.20752
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum0.019163
5-th percentile74.6441173
Q1170.8576542
Median312.343947
Q3788.7135007
95-th percentile2719.566935
Maximum76406.20752
Range76406.18836
Interquartile range617.8558465

Descriptive statistics

Standard deviation2332.792322
Coef of variation2.761005609
Kurtosis293.7202868
Mean844.9067666
MAD847.6727246
Skewness13.8524465
Sum7561915.561
Variance5441920.016
Memory size70.0 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[1.91630000e-02 3.15714000e-01 1.34702655e+01 1.49988500e+01 4.10306825e+01 ... 4.81588476e+03 7.47840380e+03 1.39163944e+04 2.97741176e+04 7.64062075e+04], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
312.343947 314 3.5%
 
299.351881 2 < 0.1%
 
140.596138 1 < 0.1%
 
111.691332 1 < 0.1%
 
129.682608 1 < 0.1%
 
872.760983 1 < 0.1%
 
144.896661 1 < 0.1%
 
176.765922 1 < 0.1%
 
1666.085318 1 < 0.1%
 
125.3494 1 < 0.1%
 
Other values (8626) 8626 96.4%
 

Minimum 5 values

ValueCountFrequency (%) 
0.019163 1 < 0.1%
 
0.037744 1 < 0.1%
 
0.05588 1 < 0.1%
 
0.059481 1 < 0.1%
 
0.117036 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
76406.20752 1 < 0.1%
 
61031.6186 1 < 0.1%
 
56370.04117 1 < 0.1%
 
50260.75947 1 < 0.1%
 
43132.72823 1 < 0.1%
 

MONTHLY_AVG_PURCHASE
Numeric

Distinct count6289
Unique (%)70.3%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean86.17517288
Minimum0
Maximum4086.630833
Zeros (%)22.8%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q13.399375
Median31.93666667
Q397.22833333
95-th percentile339.2500417
Maximum4086.630833
Range4086.630833
Interquartile range93.82895833

Descriptive statistics

Standard deviation180.5087866
Coef of variation2.094672752
Kurtosis107.8007683
Mean86.17517288
MAD92.12093085
Skewness8.004529658
Sum771267.7973
Variance32583.42202
Memory size70.0 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.00000000e+00 4.16666667e-04 8.71212121e-04 9.03333333e-01 3.79916667e+00 ... 5.01547083e+02 8.16187917e+02 1.07491708e+03 2.32283750e+03 4.08663083e+03], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 2044 22.8%
 
3.804166667 25 0.3%
 
12.5 14 0.2%
 
50 13 0.1%
 
5 13 0.1%
 
25 12 0.1%
 
8.333333333 12 0.1%
 
10 10 0.1%
 
16.66666667 10 0.1%
 
9.5 9 0.1%
 
Other values (6279) 6788 75.8%
 

Minimum 5 values

ValueCountFrequency (%) 
0 2044 22.8%
 
0.0008333333333 3 < 0.1%
 
0.0009090909091 1 < 0.1%
 
0.004166666667 1 < 0.1%
 
0.02 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
4086.630833 1 < 0.1%
 
3420.866667 1 < 0.1%
 
3336.725833 1 < 0.1%
 
3241.8925 1 < 0.1%
 
2927.596667 1 < 0.1%
 

ONEOFF_PURCHASES
Highly correlated

This variable is highly correlated with MONTHLY_AVG_PURCHASE and should be ignored for analysis

Correlation0.9130598274

ONEOFF_PURCHASES_FREQUENCY
Numeric

Distinct count47
Unique (%)0.5%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.2024576836
Minimum0
Maximum1
Zeros (%)48.1%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0.083333
Q30.3
95-th percentile1
Maximum1
Range1
Interquartile range0.3

Descriptive statistics

Standard deviation0.2983360652
Coef of variation1.473572452
Kurtosis1.161845601
Mean0.2024576836
MAD0.2329477937
Skewness1.535612784
Sum1811.996268
Variance0.08900440779
Memory size70.0 KiB
Histogram
Histogram with fixed size bins (bins=47)
Histogram
Histogram with variable size bins (bins=[0. 0.0416665 0.087121 0.1055555 0.154762 ... 0.8257575 0.845238 0.912879 0.9583335 1. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 4302 48.1%
 
0.083333 1104 12.3%
 
0.166667 592 6.6%
 
1 481 5.4%
 
0.25 418 4.7%
 
0.333333 355 4.0%
 
0.416667 244 2.7%
 
0.5 235 2.6%
 
0.583333 197 2.2%
 
0.666667 167 1.9%
 
Other values (37) 855 9.6%
 

Minimum 5 values

ValueCountFrequency (%) 
0 4302 48.1%
 
0.083333 1104 12.3%
 
0.090909 56 0.6%
 
0.1 39 0.4%
 
0.111111 26 0.3%
 

Maximum 5 values

ValueCountFrequency (%) 
1 481 5.4%
 
0.916667 151 1.7%
 
0.909091 4 < 0.1%
 
0.9 1 < 0.1%
 
0.888889 2 < 0.1%
 

PAYMENT_MIN_PAY
Numeric

Distinct count8711
Unique (%)97.3%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean9.059164108
Minimum0
Maximum6840.528861
Zeros (%)2.7%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0.2773658108
Q10.9132754946
Median2.032716823
Q36.052728819
95-th percentile20.95323521
Maximum6840.528861
Range6840.528861
Interquartile range5.139453325

Descriptive statistics

Standard deviation118.1805256
Coef of variation13.04541172
Kurtosis2078.956312
Mean9.059164108
MAD11.09640939
Skewness43.00419578
Sum81079.51876
Variance13966.63663
Memory size70.0 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.00000000e+00 3.65211523e-04 2.26087840e-02 3.20092389e-01 4.52632418e-01 ... 5.18645353e+01 7.02849126e+01 1.50124688e+02 8.44736764e+02 6.84052886e+03], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 240 2.7%
 
8.542200201 1 < 0.1%
 
0.9717347667 1 < 0.1%
 
8.096440639 1 < 0.1%
 
0.6752038902 1 < 0.1%
 
8.435840921 1 < 0.1%
 
0.5262525252 1 < 0.1%
 
15.2517368 1 < 0.1%
 
0.4904814866 1 < 0.1%
 
64.8573273 1 < 0.1%
 
Other values (8701) 8701 97.2%
 

Minimum 5 values

ValueCountFrequency (%) 
0 240 2.7%
 
0.0007304230455 1 < 0.1%
 
0.001847673413 1 < 0.1%
 
0.002481982882 1 < 0.1%
 
0.003507330989 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
6840.528861 1 < 0.1%
 
5367.043208 1 < 0.1%
 
4707.141559 1 < 0.1%
 
2846.533661 1 < 0.1%
 
2470.504187 1 < 0.1%
 

PAYMENTS
Numeric

Distinct count8711
Unique (%)97.3%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean1733.143852
Minimum0
Maximum50721.48336
Zeros (%)2.7%
Mini histogram

Quantile statistics

Minimum0
5-th percentile89.98892395
Q1383.276166
Median856.901546
Q31901.134317
95-th percentile6082.090595
Maximum50721.48336
Range50721.48336
Interquartile range1517.858151

Descriptive statistics

Standard deviation2895.063757
Coef of variation1.670411693
Kurtosis54.77073581
Mean1733.143852
MAD1553.741531
Skewness5.907619794
Sum15511637.48
Variance8381394.157
Memory size70.0 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[0.00000000e+00 2.47565000e-02 5.46557440e+01 1.49249077e+02 3.64735164e+02 ... 9.00345148e+03 1.17195868e+04 1.44720480e+04 2.30845738e+04 5.07214834e+04], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 240 2.7%
 
806.587482 1 < 0.1%
 
836.812414 1 < 0.1%
 
139.607827 1 < 0.1%
 
107.242408 1 < 0.1%
 
433.860714 1 < 0.1%
 
402.20014 1 < 0.1%
 
308.886492 1 < 0.1%
 
238.695115 1 < 0.1%
 
197.86349 1 < 0.1%
 
Other values (8701) 8701 97.2%
 

Minimum 5 values

ValueCountFrequency (%) 
0 240 2.7%
 
0.049513 1 < 0.1%
 
0.056466 1 < 0.1%
 
2.389583 1 < 0.1%
 
3.500505 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
50721.48336 1 < 0.1%
 
46930.59824 1 < 0.1%
 
40627.59524 1 < 0.1%
 
39461.9658 1 < 0.1%
 
39048.59762 1 < 0.1%
 

PRC_FULL_PAYMENT
Numeric

Distinct count47
Unique (%)0.5%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.1537146485
Minimum0
Maximum1
Zeros (%)66.0%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0
Q30.142857
95-th percentile1
Maximum1
Range1
Interquartile range0.142857

Descriptive statistics

Standard deviation0.2924991962
Coef of variation1.902871321
Kurtosis2.432395301
Mean0.1537146485
MAD0.2137870147
Skewness1.942819941
Sum1375.746104
Variance0.0855557798
Memory size70.0 KiB
Histogram
Histogram with fixed size bins (bins=47)
Histogram
Histogram with variable size bins (bins=[0. 0.0416665 0.087121 0.0954545 0.1055555 ... 0.8257575 0.845238 0.8944445 0.9583335 1. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 5903 66.0%
 
1 488 5.5%
 
0.083333 426 4.8%
 
0.166667 166 1.9%
 
0.25 156 1.7%
 
0.5 156 1.7%
 
0.090909 153 1.7%
 
0.333333 134 1.5%
 
0.1 94 1.1%
 
0.2 83 0.9%
 
Other values (37) 1191 13.3%
 

Minimum 5 values

ValueCountFrequency (%) 
0 5903 66.0%
 
0.083333 426 4.8%
 
0.090909 153 1.7%
 
0.1 94 1.1%
 
0.111111 61 0.7%
 

Maximum 5 values

ValueCountFrequency (%) 
1 488 5.5%
 
0.916667 77 0.9%
 
0.909091 19 0.2%
 
0.9 16 0.2%
 
0.888889 12 0.1%
 

PURCHASE_TYPE
Categorical

Distinct count4
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
oneoff_installment
2774
installment
2260
none
2042
ValueCountFrequency (%) 
oneoff_installment 2774 31.0%
 
installment 2260 25.3%
 
none 2042 22.8%
 
oneoff 1874 20.9%
 
Max length18
Mean length10.52558659
Min length4
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

PURCHASES
Highly correlated

This variable is highly correlated with ONEOFF_PURCHASES and should be ignored for analysis

Correlation0.9168445587

PURCHASES_FREQUENCY
Numeric

Distinct count47
Unique (%)0.5%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.4903505484
Minimum0
Maximum1
Zeros (%)22.8%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10.083333
Median0.5
Q30.916667
95-th percentile1
Maximum1
Range1
Interquartile range0.833334

Descriptive statistics

Standard deviation0.4013707474
Coef of variation0.8185383879
Kurtosis-1.638630948
Mean0.4903505484
MAD0.368256228
Skewness0.06016423586
Sum4388.637408
Variance0.1610984768
Memory size70.0 KiB
Histogram
Histogram with fixed size bins (bins=47)
Histogram
Histogram with variable size bins (bins=[0. 0.0416665 0.087121 0.0954545 0.154762 ... 0.845238 0.8944445 0.912879 0.9583335 1. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1 2178 24.3%
 
0 2043 22.8%
 
0.083333 677 7.6%
 
0.916667 396 4.4%
 
0.5 395 4.4%
 
0.166667 392 4.4%
 
0.833333 373 4.2%
 
0.333333 367 4.1%
 
0.25 345 3.9%
 
0.583333 316 3.5%
 
Other values (37) 1468 16.4%
 

Minimum 5 values

ValueCountFrequency (%) 
0 2043 22.8%
 
0.083333 677 7.6%
 
0.090909 43 0.5%
 
0.1 27 0.3%
 
0.111111 18 0.2%
 

Maximum 5 values

ValueCountFrequency (%) 
1 2178 24.3%
 
0.916667 396 4.4%
 
0.909091 28 0.3%
 
0.9 24 0.3%
 
0.888889 18 0.2%
 

PURCHASES_INSTALLMENTS_FREQUENCY
Numeric

Distinct count47
Unique (%)0.5%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean0.3644373416
Minimum0
Maximum1
Zeros (%)43.7%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q10
Median0.166667
Q30.75
95-th percentile1
Maximum1
Range1
Interquartile range0.75

Descriptive statistics

Standard deviation0.3974477797
Coef of variation1.090579187
Kurtosis-1.398632185
Mean0.3644373416
MAD0.361672884
Skewness0.509201165
Sum3261.714207
Variance0.1579647376
Memory size70.0 KiB
Histogram
Histogram with fixed size bins (bins=47)
Histogram
Histogram with variable size bins (bins=[0. 0.0416665 0.087121 0.1180555 0.154762 ... 0.8257575 0.845238 0.912879 0.9583335 1. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 3915 43.7%
 
1 1331 14.9%
 
0.416667 388 4.3%
 
0.916667 345 3.9%
 
0.833333 311 3.5%
 
0.5 310 3.5%
 
0.166667 305 3.4%
 
0.666667 292 3.3%
 
0.75 291 3.3%
 
0.083333 275 3.1%
 
Other values (37) 1187 13.3%
 

Minimum 5 values

ValueCountFrequency (%) 
0 3915 43.7%
 
0.083333 275 3.1%
 
0.090909 12 0.1%
 
0.1 6 0.1%
 
0.111111 9 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
1 1331 14.9%
 
0.916667 345 3.9%
 
0.909091 25 0.3%
 
0.9 19 0.2%
 
0.888889 28 0.3%
 

PURCHASES_TRX
Numeric

Distinct count173
Unique (%)1.9%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean14.7098324
Minimum0
Maximum358
Zeros (%)22.8%
Mini histogram

Quantile statistics

Minimum0
5-th percentile0
Q11
Median7
Q317
95-th percentile57
Maximum358
Range358
Interquartile range16

Descriptive statistics

Standard deviation24.85764911
Coef of variation1.689866236
Kurtosis34.79310026
Mean14.7098324
MAD14.64307743
Skewness4.630655266
Sum131653
Variance617.9027193
Memory size70.0 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 0. 0.5 1.5 2.5 8.5 ... 98.5 122.5 159.5 230.5 358. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 2044 22.8%
 
1 667 7.5%
 
12 570 6.4%
 
2 379 4.2%
 
6 352 3.9%
 
3 314 3.5%
 
4 285 3.2%
 
7 275 3.1%
 
8 267 3.0%
 
5 267 3.0%
 
Other values (163) 3530 39.4%
 

Minimum 5 values

ValueCountFrequency (%) 
0 2044 22.8%
 
1 667 7.5%
 
2 379 4.2%
 
3 314 3.5%
 
4 285 3.2%
 

Maximum 5 values

ValueCountFrequency (%) 
358 1 < 0.1%
 
347 1 < 0.1%
 
344 1 < 0.1%
 
309 1 < 0.1%
 
308 1 < 0.1%
 

TENURE
Numeric

Distinct count7
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean11.51731844
Minimum6
Maximum12
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum6
5-th percentile8
Q112
Median12
Q312
95-th percentile12
Maximum12
Range6
Interquartile range0

Descriptive statistics

Standard deviation1.338330769
Coef of variation0.1162015947
Kurtosis7.694823186
Mean11.51731844
MAD0.8180239069
Skewness-2.943017288
Sum103080
Variance1.791129248
Memory size70.0 KiB
Histogram
Histogram with fixed size bins (bins=7)
Histogram
Histogram with variable size bins (bins=[ 6. 6.5 9.5 10.5 11.5 12. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
12 7584 84.7%
 
11 365 4.1%
 
10 236 2.6%
 
6 204 2.3%
 
8 196 2.2%
 
7 190 2.1%
 
9 175 2.0%
 

Minimum 5 values

ValueCountFrequency (%) 
6 204 2.3%
 
7 190 2.1%
 
8 196 2.2%
 
9 175 2.0%
 
10 236 2.6%
 

Maximum 5 values

ValueCountFrequency (%) 
12 7584 84.7%
 
11 365 4.1%
 
10 236 2.6%
 
9 175 2.0%
 
8 196 2.2%
 

Correlations

Missing values

Sample

First rows

BALANCEBALANCE_FREQUENCYCASH_ADV_AMOUNTCASH_ADVANCECASH_ADVANCE_FREQUENCYCASH_ADVANCE_TRXCREDIT_LIMITINSTALLMENTS_PURCHASESLIMIT_USAGEMINIMUM_PAYMENTSMONTHLY_AVG_PURCHASEONEOFF_PURCHASESONEOFF_PURCHASES_FREQUENCYPAYMENT_MIN_PAYPAYMENTSPRC_FULL_PAYMENTPURCHASE_TYPEPURCHASESPURCHASES_FREQUENCYPURCHASES_INSTALLMENTS_FREQUENCYPURCHASES_TRXTENURE
040.9007490.8181820.0000000.0000000.00000001000.095.400.040901139.5097877.9500000.000.0000001.446508201.8020840.000000installment95.400.1666670.083333212
13202.4674160.909091536.9121246442.9454830.25000047000.00.000.4574951072.3402170.0000000.000.0000003.8262414103.0325970.222222none0.000.0000000.000000012
22495.1488621.0000000.0000000.0000000.00000007500.00.000.332687627.28478764.430833773.171.0000000.991682622.0667420.000000oneoff773.171.0000000.0000001212
31666.6705420.63636417.149001205.7880170.08333317500.00.000.222223312.343947124.9166671499.000.0833330.0000000.0000000.000000oneoff1499.000.0833330.000000112
4817.7143351.0000000.0000000.0000000.00000001200.00.000.681429244.7912371.33333316.000.0833332.771075678.3347630.000000oneoff16.000.0833330.000000112
51809.8287511.0000000.0000000.0000000.00000001800.01333.281.0054602407.246035111.1066670.000.0000000.5816011400.0577700.000000installment1333.280.6666670.583333812
6627.2608061.0000000.0000000.0000000.000000013500.0688.380.046464198.065894590.9175006402.631.00000032.0818206354.3143281.000000oneoff_installment7091.011.0000001.0000006412
71823.6527431.0000000.0000000.0000000.00000002300.0436.200.792892532.03399036.3500000.000.0000001.276357679.0650820.000000installment436.201.0000001.0000001212
81014.9264731.0000000.0000000.0000000.00000007000.0200.000.144989311.96340971.790833661.490.0833332.206280688.2785680.000000oneoff_installment861.490.3333330.250000512
9152.2259750.5454550.0000000.0000000.000000011000.00.000.013839100.302262106.8000001281.600.16666711.6126051164.7705910.000000oneoff1281.600.1666670.000000312

Last rows

BALANCEBALANCE_FREQUENCYCASH_ADV_AMOUNTCASH_ADVANCECASH_ADVANCE_FREQUENCYCASH_ADVANCE_TRXCREDIT_LIMITINSTALLMENTS_PURCHASESLIMIT_USAGEMINIMUM_PAYMENTSMONTHLY_AVG_PURCHASEONEOFF_PURCHASESONEOFF_PURCHASES_FREQUENCYPAYMENT_MIN_PAYPAYMENTSPRC_FULL_PAYMENTPURCHASE_TYPEPURCHASESPURCHASES_FREQUENCYPURCHASES_INSTALLMENTS_FREQUENCYPURCHASES_TRXTENURE
8940130.8385541.0000000.0000000.0000000.00000001000.0591.240.13083982.77132098.5400000.000.0000005.745025475.5232621.00installment591.241.0000000.83333366
89415967.4752700.8333331425.9015548555.4093260.666667139000.0214.550.663053861.94990635.7583330.000.0000001.120950966.2029120.00installment214.550.8333330.66666756
894240.8297491.0000000.0000000.0000000.00000001000.0113.280.04083086.28310118.8800000.000.0000001.09510294.4888280.25installment113.281.0000000.83333366
89435.8717120.5000000.0000000.0000000.0000000500.00.000.01174343.4737173.48333320.900.1666671.34897358.6448830.00oneoff20.900.1666670.00000016
8944193.5717220.8333330.0000000.0000000.00000004000.00.000.048393312.343947168.7883331012.730.3333330.0000000.0000000.00oneoff1012.730.3333330.00000026
894528.4935171.0000000.0000000.0000000.00000001000.0291.120.02849448.88636548.5200000.000.0000006.660231325.5944620.50installment291.121.0000000.83333366
894619.1832151.0000000.0000000.0000000.00000001000.0300.000.019183312.34394750.0000000.000.0000000.883197275.8613220.00installment300.001.0000000.83333366
894723.3986730.8333330.0000000.0000000.00000001000.0144.400.02339982.41836924.0666670.000.0000000.98607681.2707750.25installment144.400.8333330.66666756
894813.4575640.8333336.09313036.5587780.1666672500.00.000.02691555.7556280.0000000.000.0000000.94250552.5499590.25none0.000.0000000.00000006
8949372.7080750.66666721.173335127.0400080.33333321200.00.000.31059088.288956182.2083331093.250.6666670.71543963.1654040.00oneoff1093.250.6666670.000000236